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Abstract 

We study the online convex covering problem and online convex packing problem. The 
(offline) convex covering problem is modeled by the following convex program: 

min f(x) s.t. Ax > 1 
xgR!J; 

where / : H." i—>■ M+ is a monotone and convex cost function, and A is an m x n matrix with 
non-negative entries. Each row of the constraint matrix A corresponds to a covering constraint. 
In the online problem, each row of A comes online and the algorithm must maintain a feasible 
assignment x and may only increase x over time. The (offline) convex packing problem is 
modeled by the following convex program: 

max X:”Li V] - 9(A^y) 
yeRy 

where g : R" i—>■ R+ is a monotone and convex cost function. It is the Fenchel dual program of 
convex covering when g is the convex conjugate of /. In the online problem, each variable yj 
arrives online and the algorithm must decide the value of yj on its arrival. 

We propose simple online algorithms for both problems using the online primal dual tech¬ 
nique, and obtain nearly optimal competitive ratios for both problems for the important special 
case of polynomial cost functions. For instance, for any convex homogeneous polynomial cost 
functions with non-negative coefficients and degree r, we introduce an 0(t log n)'^-competitive 
online convex covering algorithm, and an 0(r)-competitive online convex packing algorithm, 
matching the known H(Tlogn)'^ [5] and ^(t) [21] lower bounds respectively. 

There is a large family of online resource allocation problems that can be modeled under this 
online convex covering and packing framework, including online covering and packing problems 
(with linear objectives) [14], online mixed covering and packing [4], and online combinatorial 
auction [7, 9]. Our framework allows us to study these problems using a unified approach. 
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1 Introduction 


Due to its wide practical applications, online optimization has received much attention in the 
communities of computer science, operation research, and applied mathematics. In particular, in 
an online resource allocation problem, there is a set of resources and a set of requests, each of which 
demands certain amount of each resource. The requests arrive one by one online, and the online 
algorithm must decide whether to serve a request and/or how to serve the request immediately on 
its arrival without information of future requests. 

Most previous work on online resource allocation problems can be divided into two classes: 
online covering problems, where the online algorithm must satisfy all (covering) requests with an 
objective of minimizing a linear cost function for using the resources; and online packing problems, 
where the online algorithm aims to maximize the total number of served requests (or, more generally, 
the total value generated from the served requests) subject to fixed resource capacities.^ However, 
the two extreme cases of either having linear resource costs or having fixed resource capacities (i.e., 
zero-infinity step cost function) are far from capturing the whole story in practical scenarios. For 
many typical resources, such as computing cycles, memory, labor, oil, etc., additional resources 
can be obtained at increasing marginal costs, which cannot be modeled in the traditional online 
covering and packing framework. This observation motivates the natural question of how to solve 
online resource allocation problems with general convex cost functions. 

On the other hand, there have been a few spontaneous studies on problems with mixed covering 
and packing constraints, e.g., online facility location [1], online unrelated machine scheduling [2, 3], 
and the recent work by Azar et al. [4] for an attempt to develop a unified framework for such 
problems. Problems with both covering and packing constraints are traditionally considered to be 
more difficult than problems with only one types of constraints. Indeed, most known algorithms for 
mixed covering and packing problems and their analysis are more complicated compared to their 
counterparts in covering or packing problems. 

1.1 Our Contributions 

The first contribution of this paper is a conceptual one. We propose an online convex covering and 
packing framework that allows us to study a large family of online resource allocation problems 
with general convex cost functions under a unified framework. Interestingly, we can also remodel 
the mixed covering and packing problem by Azar et al. [4], which was originally modeled using 
linear programs, as a pure covering problem, but with a convex cost function. Then, we are able 
apply our unified framework to the problem after remodelling and obtain a better competitive ratio 
with a simpler algorithm and analysis. 

Our main technical contributions are two simple online algorithms for online convex covering and 
online convex packing respectively. Our algorithms achieve nearly optimal competitive ratios for 
both problems for the important special case of polynomial cost functions (that are convex and have 
non-negative coefficients), where the competitive ratio is parameterized by the maximum degree of 
the cost function. For online convex covering, we also show that our algorithm is competitive with 
an additive error for general cost functions. 

We focus on polynomial cost functions for two reasons. First, they are the most common type 
of cost functions when we model practical problems using convex programs. For example, if we 
want to model energy minimization in online scheduling, all known theoretical studies (e.g., [22]) 

^Note that the other two possible combinations, i.e., covering problems with fixed resource capacities and packing 
with linear resource costs, have trivial 1-competitive online algorithms. 
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consider polynomial power functions. There are also a large family of problems, e.g., the mixed 
covering and packing problem [4], whose objective can be modeled as the £p-norm of a set of linear 
or polynomial functions; hence, the p-th power of the objective is a polynomial. Second, there 
are known online optimization problems for which polynomial cost functions are the “worst” cost 
functions which still admit bounded competitive ratios. For example, Devanur and Huang [15] 
(Thm. 3.2 and 5.1) showed that the problem of minimizing flow-time plus energy admits bounded 
competitive ratios if and only if there is a parameter r such that the power function is “at most as 
convex as” degree-r polynomials. We next discuss our framework and results. 


Online Convex Covering: In an online covering problem, there are n resources and let Xi denote 
the amount of resource i that are used by the algorithm; there are m (covering) requests that arrive 
online, where each request j is specified by a vector aj = {aji,... ,ajn), and has the form of 
^ 1- The goal is to minimize the resource costs subject to satisfying all requests. Unlike 
the original online covering problem, where the cost function is linear in x, in the online convex 
covering framework, we allow a general convex cost function / such that producing Xi amount of 
each resource i for all i = 1,... ,n incurs a cost of f{x). Clearly, the design of online algorithms 
and the corresponding competitive analysis will depend on properties of the cost function /. For 
polynomial cost functions, we show the following result. 

Theorem 1.1 (Online Convex Covering) Suppose the cost function f is a convex multivariate 
polynomial with non-negative coefficients and maximum degree r. 

(a) If f is a homogeneous polynomial, then there is an O {t log nY-competitive online algorithm, 
(h) If f is a sum of N monomials, then there is an 0{T\ogNY-competitive online algorithm. 

The above result is nearly optimal due to a lower bound by Azar et al. [5, Theorem 3] showing 
that if the objective is the £p-norm of a set of linear cost functions, the competitive ratio of any 
online algorithm is at least H(plogn) (for sufficiently large n). To compare with our results, we 
set T = p and take the p-th power of the .^p-norm to convert it into a degree-r homogeneous 
polynomial such that the above lower bound becomes H(rlogn)'^. For general cost functions, we 
show the following. 


Theorem 1.2 Suppose the cost function f is at most as convex as degree-r polynomials in the sense 
that {x,Vf{x)) < Tf{x). Suppose in our online convex covering algorithm, the primal vector x is 
initially set to L, and let U be an upper bound of x (e.g., Ui = 




-); denote p. = maxjgj^’ Ti 


m Li 


Then, our online convex covering algorithm is 0 {t In pY'^competitive with an additive error of f{L). 


As a concrete application of our framework and our online convex covering algorithm, we obtain 
an improved competitive ratio for the online mixed packing and covering problem. 

• .£p-norm of linear functions: In fact, let us consider a more general problem. Consider I 
vectors Ck G {k G [1]), and define A : M” —)• such that for k G [Z], Xk{x) := {ck,x). For 

some p > 1, the cost function is ||A(x)||p, which is the .£p-norm of the /-dimensional vector A(T). 
This cost function is considered in Azar et al. [5], and the online mixed covering and packing 
problem [4] is the special case when p = oo. 


Typically, the problem is characterized by a sparsity parameter d, which in our case is the max¬ 
imum number of non-zero coordinates in each of the vectors Cfc’s.^ For finite p > 1, we consider 

^The notions of sparsity in different work are somewhat different; see Section 1.4 for details. 
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the polynomial f{x) := ||A(x)||p, whose degree is at most p. Observe that since each vector Ck 
has at most d non-zero coordinates, each term Afc(x)^ introduces at most d variables. Hence, 
f{x) is a degree-p homogeneous polynomial that depends on at most Id variables. However, as 
we shall see in Corollary 3.1, a careful application of Theorem 1.1 can achieve an 0{plogd)P- 
competitive online algorithm with respect to the cost function /. Taking the p-th root, the 
competitive ratio for the ip-norm is 0{plogd), simplifying the competitive analysis of Azar et 
al. [4, 5] by removing the logarithmic dependency on the ratios of non-zero coordinates in Uj’s 
and Cfc’s. 

Since the dimension of vector A(x) is I, it is well known that its ^co-norm can be approximated 
within constant factor by the £iogz-norm. Hence, using p = log I in our framework, we obtain a 
competitive ratio of O{log I log d) for the online mixed covering and packing problem. 

Online Convex Packing: In an online convex packing problem, there are n resources and let Zi 
denote the amount of resource i that is used by the algorithm. There are (packing) requests that 
arrive online, where each request is specified by a vector Sj = {aji, ..., ajn) and aji is the amount 
of resource i needed for serving one unit of request j. Unlike the original online packing problems, 
where the resources are subject to fixed capacities, we will allow a general convex cost function g 
such that producing zt amount of each resource i for alH = 1,..., n incurs a resource cost of g{z). 
The goal is to maximize the total units of served requests minus the resource costs. The design 
of online algorithms and the corresponding competitive analysis will depend on certain properties 
of the cost function g. It turns out that online convex packing is the Fenchel dual problem of the 
online convex covering, when g is the convex conjugate of /. See Section 2 for a brief discussion of 
convex conjugate and Fenchel duality. 

Theorem 1.3 For any polynomial cost function g that is convex, and has non-negative coefficients 
and maximum degree t >1, our online convex packing algorithm is 0{t)- competitive. 

The above positive result also extends to a more complicated type of packing problems known 
as combinatorial auction with production cost. 

• Online combinatorial anction with production cost: Online combinatorial auction is a 
natural extension of online packing. In an online combinatorial auction, each online request 
j is a buyer associated with a value function Vj, where Vj{yj) is buyer j’s value if he gets yji 
units of resource i, i £ [n], in the auction. In other words, there are multiple ways of serving 
each request j; each way generates a different value and consumes a different amount of each 
resource. The goal is to maximize the total value of the final allocation minus resource costs. 
This problem was first studied in the fixed capacity setting [7, 12] and later in a setting with 
separable resource costs [9, 21], i.e., the cost can be written as g{z) = 

It is straightforward to extend our our results from online convex packing to online combinatorial 
auction with production cost. We obtain an 0(r)-competitive online algorithm for any convex 
polynomial cost functions with non-negative coefficients and maximum degree r. This ratio is 
tight due to a lower bound result by Huang and Kim [21]. 

1.2 Our Techniques 

Our technique is based on the online primal dual framework for solving linear programs (see, e.g., 
[13] for a survey), and its recent extensions to convex programs (e.g., [15, 21, 16, 20, 18]). The 
convex covering and packing problems are modeled by the following convex programs, where A is 
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an m X n matrix with non-negative entries and is its transpose: 


(Covering) 

(Packing) 


min fix) subiect to Ax > 1 
xgK!;: 


max 


Er.i y, - alA-^y) 


As remarked in the previous subsection, these two problems are Fenchel dual programs of each 
other when g is the convex conjugate of /, i.e., g = f* where /*(i') := max,^>o {(F,^ — /(-?)}• In 
particular, weak duality holds: for any feasible covering assignment x G such that Ax > 1, and 
for any feasible packing assignment y G M™, we have f{x) > J2jLi Vj ~ 

Our online primal dual algorithms simultaneously maintain feasible covering and packing as¬ 
signments, namely, x and y, such that (1) x and y are non-decreasing over time, and (2) at all times 
the covering objective f{x) is at most a times the packing objective Y^^=iVj ~ Qi-^v) some 
parameter a > 1. The first condition ensures that our algorithm is a feasible online algorithm for 
both the covering and packing problems. The second condition, together with weak duality, shows 
that our online algorithm is a-competitive for both the online convex covering problem with cost 
function /, and the online convex packing problem with cost function f*. 

While our framework gives online algorithms for both covering and packing simultaneously, we 
do design different algorithms for the two problems separately. This is because when the covering 
problem is of primary interest, we will exploit certain natural assumptions on the covering cost 
function /, such as monotone marginal cost (i.e., the gradient V/ is monotone), to get good 
competitive ratios. Similarly, when the packing problem if of primary interest, we will exploit 
assumptions such as Vg' = V/* being monotone to design and analyze our online algorithms. Note 
that such natural assumptions on / does not translate to natural assumptions on its conjugate 
g = f* in general. In particular, V/ being monotone does not imply that V/* is monotone, and 
vice versa.^ 

Covering: Our online convex covering algorithm is very natural. On the arrival of a covering 
request j, our algorithm continuously increases x until request j is satisfied, and for each resource i, 
it increases Xi exponentially at a rate proportional to the bang-per-buck ratio, i.e., Uji (fraction 
of request j we can cover per unit of resource i) divided by Vj/(x) (the partial derivative of / in 
the Tth coordinate). In other words, the change of Xi is proportional to -^-j^Xi. Our algorithm 
generalizes the multiplicative update process for the original online covering problem with linear 
objectives [14] and several other online optimization problems (e.g., [8]) to handle general convex 
objectives. See Section 3 for details of our algorithm. 

A novel technical ingredient of our approach is a new way to obtain competitive ratios that 
are independent of the entries of the constraint matrix A. Consider the original online covering 
problem with linear objective as an example. It is known that if we use the natural linear program 
relaxation and its dual, and we update both the covering and the packing assignments in a non¬ 
decreasing manner, then the competitive ratio is at least II(logn-|-log ^nia^) (e.g., [14]), where Umax 
and amin are the maximum and minimum non-zero entries of the constraint matrix A. Previous 
work such as Buchbinder and Naor [14] and Gupta and Nagarajan [19] remove the dependency on 
the entries of A by allowing the (dual) packing assignment to decrease. 

®For instance, the function f{xi,X 2 ) ’■= {xi + X 2 )^ and its gradient V/ are monotone on R+, but its convex 
2^ 2^ 

conjugate /*(zi, 22 ) = max{^, -^} is not differentiable on the line zi = 22 and V/* is not monotone. For example, 
when 22 is increased from (1,0) to (1,2), Vi/* decreases from Vi/*(1,0) = ^ to Vi/*(1,2) = 0. 
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We use a different approach. We first observe that the extra dependency on the entries of A 
is due to the linear part of the cost function—for any “snfficiently convex” cost fnnction /, our 
online convex covering algorithm gives a competitive ratio that is independent of the entries of A. 
Then, to obtain competitive ratios for any polynomial cost fnnctions /, we first approximate the 
cost function / with another snrrogate polynomial cost function / that is “snfficiently convex”, and 
then rnn onr online convex covering algorithm with respect to /. See Section 3 for details. 

We believe that the above technique, while simple, will find further applications in other online 
optimization problems. In particular, it suggests an interesting and unexpected way of nsing our 
online convex covering and packing framework, as well as work of others, on extending online 
primal dual to convex programs. Even for problem with natural linear program relaxations, it 
may be helpful to remodel the problem using convex programs becanse convexity may improve the 
competitive ratios of online primal dual algorithms. 

Packing: For the online convex packing problem, the (dual) covering assignment x can be inter¬ 
preted as unit prices of the resources, where Xi is the unit price of resource i. In the offline optimal 
solution, the Karush-Kuhn-Tucker conditions (e.g., [10]) indicate that the unit prices shall be equal 
to the marginal cost, i.e., x = Vg(A^y). In the online setting, however, the online algorithm does 
not know the final demand of resources A"^y, and thns, does not know the final marginal cost 
S/g{A^y). Instead, the online algorithm may predict the final demand given the current demand. 
Our online algorithm uses a particularly simple prediction rule: for some parameter p > 1, it pre¬ 
dicts the final demand to be p times the current demand and set x = \7g{pA'^y). This is a natnral 
generalization of the algorithms in [9] and [21], which considers only separable cost functions that 
can be written as g{z) = gi{zi) + ■ ■ ■ + gm{zm)- It turns out this simple approach is able to obtain 
asymptotically tight competitive ratios for polynomial cost functions. See Section 4 for details. 

1.3 Related Work 

This paper employs the online primal dual technique which, informally speaking, utilizes the struc¬ 
ture of the linear program relaxation of an optimization problem and its dual program to design and 
analyze online algorithms. The technique has led to optimal or nearly optimal online algorithms 
for a large family of problems, e.g., online covering and packing (with linear objectives) [14], online 
caching [ 6 ], online bipartite matching [17], etc. Readers are referred to [13] for a survey of results 
using the online primal dual technique. 

In particular, our work adds into the recent online of work on extending the online primal dual 
technique to problems related to convex programs, e.g., online schednling with speed-scaling [15, 19], 
online matching with concave retnrns [16], online combinatorial auction with convex production 
cost [21], etc. Before our work, the study of the online primal dual technique with convex programs 
has been on a problem by problem basis. Our online convex covering and packing framework can 
be viewed as a concrete step towards building a unified theory for using the online primal dual 
technique with convex programs. 

Next, we discuss a few related work that are most related to our results. Buchbinder and Naor 
[14] studied the online covering and packing problem with linear objectives, which are special cases 
of our more general problems. They proposed an 0(log n)-competitive online algorithm for online 
covering and an 0 (logn -|- log ^s^j-competitive online algorithm for online packing. 

Azar et al. [4] proposed the online mixed packing and covering problem as an attempt to unify 
several online optimization problems with both covering and packing constraints, and proposed the 
first algorithm for this general problem with competitive ratio 0 ( log (ilog(Z/ 37 )) where (3 and 7 
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are parameters that depend on the covering and packing constraint matrices of the problem. We 
obtain an improved competitive ratio of 0 (logdlog/) without the dependency on the parameters 
of constraint matrices. Due to a lower bound result of Azar et al. [5], our competitive ratio is tight. 

Blum et al. [9] proposed the study of online combinatorial auction with production cost, where 
the seller may produce multiple copies of each item subject to a convex cost function. In particular, 
for polynomial cost functions of degree r, Blum et al. [9] gave an 0(T)-competitive online algorithm. 
Later, Huang and Kim [21] improved the competitive ratios of Blum et al. [9] by constant factors 
and showed that the optimal competitive ratio can be characterized by a differential equation. 
Note that the cost functions in both Huang and Kim [21] and Blum et al. [9] are defined on a per 
item basis, which corresponds to the special case of separable cost function that can be written 
as g{z) = gi{zi) + • • • + gmi^m) in our model. In contrast, we give an 0(r)-competitive online 
algorithm that can handle arbitrary degree-r polynomial cost functions that may have correlation 
across different items. 

1.4 Concurrent Work 

In concurrent works, Azar et al. [5] and Buchbinder et al. [11] have also independently studied the 
same problems. Below we sketch the main differences between our work and those of the other two 
groups. 

Azar et al. [5] have considered the online convex covering problem, but not the packing problem 
explicitly. Their main result is essentially the same as our Theorem 1.2 for general cost functions. 
We stress that a major part of our effort in this paper is spent on improving the competitive ratio for 
general cost functions by focusing on the important special case of polynomial cost functions. We 
propose a simple online algorithm with nearly optimal competitive ratio for this case (Theorem 1.1). 

Buchbinder et al. [11] have studied both online convex covering and packing problems. Their 
competitive ratio for online convex covering has a rather involved dependency on the cost function 
/; it does not seem possible to directly compare our results with theirs. While both the algorithms 
of Buchbinder et al. [11] and ours are based on the online primal dual framework, there are many 
subtle differences. For example, the main algorithm of Buchbinder et al. [11] may decrease dual 
variables while our algorithm always maintain dual variables in a non-decreasing manner. 

It is instructive to compare the results for the online mixed packing and covering problem, an 
application of the online convex covering framework which all three groups consider. Azar et al. [5] 
obtained an 0(log Hog d/lqj-competitive algorithm, where (5 and 7 are the maximum-to-minimum 
ratio of the non-zero entries of Uj’s and c^’s respectively, and both a^’s (covering constraint vectors) 
and Cfc’s (packing constraint vectors) must be d-sparse in the sense that each vector may have at 
most d non-zero entries. Buchbinder et al. [11] obtained an 0(log / log d)-competitive algorithm, 
removing the logarithmic dependency on /3 and 7 , and only the covering constraint vectors need to 
be d-sparse. In this paper, we propose an 0(log Z log d)-competitive algorithm, also removing the 
logarithmic dependency on fd and 7 , and only the packing constraint vectors need to be d-sparse. 

For online convex packing, Buchbinder et al. [11] exploited the fact that it is the dual problem 
of online convex covering and directly use one of their online convex covering algorithms. As a 
result, their algorithms and competitive ratios are parameterized by the convex conjugate of the 
cost function. For some specific cost functions, they managed to reformulate the problem and get 
around the problem of having non-monotone gradient of the convex conjugate. Readers are referred 
to their paper for details. 

In contrast, we design simple online algorithms that are specialized for the online convex pack- 
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ing problem and the online combinatorial auction problem, and obtain asymptotically optimal 
competitive ratio for general convex polynomial cost functions (Theorem 1.3). 

2 Preliminaries 

For a positive integer re, we denote [re] := {1,2,..., re}. We use x to denote a column vector. For 
two vectors a and b of the same dimension, we write o > 6 if each coordinate of a is at least the 
corresponding coordinate of b. We use (a, b) to denote the dot product of a and b. Let 0 and 1 
denote the all zero’s and all one’s vectors, respectively. A function g : M!} —)• is monotone, if 
a <b implies that g{a) < g{b). For i £ [re], we use e* to denote the unit vector whose ith coordinate 
is 1. 

Conjugate 

Given a function / : ^ R_|_, its conjugate f* : —)■ M+ is defined as: 

f*{z) = max - f{x)} 

T 

For example, if f{x) = is a degree-r polynomial (r > 1), then f*{z) = (1 — is a 

degree-:;:^ polynomial. 

Note that (x, z) — /(x) can be interpreted as the negative of the y-intercept of a hyperplane that 
passes through point x and has gradient 2 ;. Further, the negative of the y-intercept is maximized 
when the hyperplane is a tangent hyperplane of function /. Hence, f*{z) can be interpreted as the 
y-intercept of the tangent hyperplane of / that has gradient z. 

For the rest of this paper, we will focus on functions / that are non-negative, monotone and 
differentiable, and /(O) = 0. In this case, the conjugate satisfies the following properties: 

• The conjugate /* is non-negative, monotone, convex, and /*(0) = 0. 

• If / is lower semi-continuous, /** = /. 

Readers are referred to the textbook of Boyd and Vandenberghe [10, Chapter 3.3] for detailed 
discussions on conjugate. 

Online Convex Covering and Packing Problems 

Convex Covering: We will first introduce the offline convex covering problem. Let there be re 
resources. Producing x* units of resource i for each i £ [re] incurs a production cost of /(x), where 
/ : M” —^ M_|_ is convex, non-negative, monotone, and differentiable, and /(O) = 0. Let there be m 
covering requests: > 1 for j £ [m], where Ojj’s are non-negative. The objective is to 

minimize the total production cost /(x) while satisfying all covering requests. 

Let dj denote the non-negative vector (oji,aj 2 ) ■ ■ ■ jOjn) and A = (ojj) denote the mxn matrix 
whose rows are Oj’s. The offline convex covering problem can be formulated as a convex program 

mm C{x) := /(x) subject to Ax > 1 (2.1) 

In the online convex covering problem, the covering requests arrive one by one. In round 
k £ [rre], covering request k arrives with the vector dk £ R” (we use j as a generic index of 
covering constraints and k as the index of the current request), and the covering player must 
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decide immediately how to increase some of the Xj’s to satisfy the request X]iG[n] ^kiXi > 1 without 
knowing future requests. Note that the algorithm cannot decrease the values of Xj’s throughout 
the process. We allow the covering requests to be chosen adversarially depending on the past 
response of the algorithm. The goal is to approximately minimize the covering objective C{x) 
while satisfying all covering requests. 

Convex Packing: We start with the offline problem. Let there be n resources. Producing 
Zi units of resource i for each i S [n] incurs a production cost of g{z), where g : R” —)• R+ is convex, 
non-negative, monotone, and differentiable, and g{0) = 0. Let there be m packing requests, where 
each request j G [m] is specified by a non-negative vector Sj = (aji,aj 2 , • • • ,ajn) such that serving 
one unit of packing request j consumes aji units of resource i. The objective is to maximize the 
total number of units of packing requests served minus the total production cost. 

Recall that A = {aji) denotes the m x n matrix whose rows are Qj’s. Let yj denote the number 
of units of packing request j that the algorithm decides to serve. Then, the total amount of 
resource i consumed is Zi = offline convex packing problem can be formulated 

as the following convex program. 

max P{y)-=Ejelm]yj-9{A'^y) (2-2) 

In the online problem, the packing requests arrive one by one. In round k G [m], packing request 
k arrives with the vector Sk, and the packing player must irrevocably pick a value yk > 0. We can 
also view that there is some vector y that is initially set to 0, and in round k, the online algorithm 
can only increase the A:-th coordinate of y. The goal is to approximately maximize the packing 
objective P{y). 

Convex Covering/Packing Duality: If the cost function of the convex packing problem is the 
conjugate of the cost function of the convex covering problem, namely, g{z) = f*{z), the convex 
packing program (2.2) is the Fenchel dual program of the convex covering program (2.1). The 
reader is referred to [10] for details on Fenchel duality. 

When we have explicit assumptions on /, we emphasize the performance of the covering player, 
and let the packing player play an auxiliary role. On the other hand, when we have explicit 
assumptions on the conjugate /*, the roles are reversed, and we put emphasize on the performance 
of the packing player. 

Lemma 2.1 (Weak Duality) For any x G R” such that Ax > 1 and any y G R™, we have that 

C{x) > P{y), 

which holds even if f is not convex. 

In the rest of this paper, we will use f* to denote the cost function in the convex packing 
problem, and the corresponding convex program becomes 

max P{y) ■■= EiGH Vj " r{A^y) (2-3) 

Competitive Ratio: We will compare the expectation of the objective value achieved by an 
online algorithm with the offline optimum. Let C°P and P°P denote the offline optimum of the 
convex covering program and convex packing program respectively. We use the convention that 
the competitive ratio is at least 1. Hence, for an online covering algorithm that returns x, its ratio 
is for an online packing algorithm that returns y, its ratio is 



Online Primal Dual Framework 

Readers are referred to, e.g., the survey by Buchbinder and Naor [14], for a comprehensive dis¬ 
cussion of the online primal dual framework and its applications for problems with linear program 
relaxations, and, e.g., Devanur and Jain [16], Devanur and Huang [15], etc., for its recent extensions 
to convex programs. Below we briefly describe how the framework works for our problems. 

Online Primal Dual Algorithm: Our online primal dual algorithms maintain both a feasible 
covering assignment x and a feasible packing assignment y G online. More precisely, we can 
view an online primal dual algorithm as running an online covering algorithm and an online packing 
algorithm simultaneously as follows: 

• Covering Algorithm. A vector x G M” is initially set to 0. In round k G [m], the covering 

player can increase some coordinates of x such that the covering constraint HkiXi = 

{ak,x) > 1 is satisfied. The goal is to minimize C{x) := f{x) at the end of the process while 
satisfying all covering constraints. 

• Packing Algorithm. In round k G [m], the packing player irrevocably picks a value yk > 0. 
We can view that there is some vector y that is initially set to 0, and in round k, the packing 
player can only increase the fc-th coordinate of y. The goal is to maximize P{y)- 

We note that there are online primal dual covering algorithms in literature that may decrease 
yj^s. For example, [13] showed that decreasing yj's is crucial for obtaining competitive ratio that is 
independent of the parameters of the constraint matrix A for the online covering with linear cost if 
we use the natural linear program relaxation. In contrast, we obtain a competitive ratio independent 
of the parameters of the constraint matrix by considering an alternative convex program formulation 
the problem. See Sections 3 and 4 for details. 

Online Primal Dual Analysis: As the covering and the packing algorithms maintain their 
corresponding feasible solutions x and y such that after every round, we seek to preserve an invariant 
C{x) < a - P{y), where a is a parameter that can depend on the function / (and its conjugate /*) 
and n, but is independent on the number of requests m. 

Because of weak duality C{x) > C°P > P°P > P{y), it follows that C{x) < a ■ P{y) < a ■ (7°^*, 
and P°P^ < C{x) < a ■ P{y)- Hence, the online primal dual algorithm described above achieve a 
competitive ratio at most ol for both the online covering and packing problems. 

3 Convex Online Covering 

Recall that the covering and the packing problems are dual of each other. In this section, we will 
treat the convex covering problem as primal and the convex packing problem as dual. 

(Primal) mm C{x) := f{x) subject to Ax>l (3.1) 

(Dual) max P{y) := EjgM Vj “ (3-2) 

Recall that for notational convenience, we denote z := Ai^y. We make the following assumptions 
on the cost function / in the covering (primal) problem throughout this section. The hrst set of 
assumptions are regularity assumptions that we have discussed in the preliminary. 

Assumption 3.1 The function f is convex, differentiable, and monotone, and /(()) = 0. 
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Beyond the above regularity assumptions, we will further assume that the marginal cost is 
monotone. Recall that the function / models the production cost of resources. This assumption 
is saying that producing more copies of some resource i does not decrease the production cost of 
resource i' (regardless of whether i' = i ov not). In Section 3.2.3, we will have an even stronger 
monotone condition V/ to get a better competitive ratio. 

Assumption 3.2 The gradient Vf is monotone. 

Finally, we introduce a parameter r > 1 that measures how convex the function / is. 

Assumption 3.3 There is a parameter r > 1 such that for all x G R”, (V/(x),x) < r • f{x). 

For instance, if / is a multivariate polynomial with non-negative coefficients, r can be chosen as 
the maximum degree of /. In particular. Assumption 3.3 implies the following inequalities which 
will be useful in our competitive analysis. We remark that the convexity of / is only explicitly used 
to prove part (c) of Lemma 3.1. 

Lemma 3.1 (Convexity of /) Suppose that for some t > 1, for all x G R”, (V/(x),x) < r-/(x). 
Then, the following statements hold. 

(a) For d > 1, for x G R+, f{Sx) < S^f{x). 

(b) For 0 < 7 < 1, for z £ R”, /^(yT) < 7 ^^ • f*{z). 

(c) Suppose further that f is convex. Then, for all x G R+, 
r (V/(f)) = (f, V/(x)) - fix) < (r - 1) • fix). 

In particular, statements (b) and (c) implies thatforO < 7 < 1, /*( 7 -V/(x)) < 7^-1 -(r —1)-/(T). 

Proof: For statement (a), define the function g : [l,+oo) —)• R by giO) := ln/(0x). Then, 

g'iO) = where the last inequality follows because ifi9x),9x) < rfidx). Integrating 

over 9 G [1,5] gives gi5) — gil) < Tln5, which is equivalent to /(5x) < 6'^fix). 

, 1 

For statement (b), for 0 < 7 < 1, let 5 := > 1, we have 

f*i'yz) = max{(x, 7 ^) — fix)} = 7 ^^ max{(5T, — 5'^ fix)} 

xGR^ xGR^ 

< 7 "^ max{(5T, —/(5x)} = 7 "^/*(T) , 

xGR^ 

where the inequality follows from statement (a). 

For statement (c), observe that /*(V/(x)) = max^gR^ hiw), where /i(uJ) := (u;, V fix)) — fiw). 
Hence, V/i(u;) = V fix) —V fiw). Since / is a convex function, h is a concave function and V/i(x) = 
0; it follows that h is maximized when w = x. Therefore, we have /*(V/(T)) = ix,^ fix)) — fix) < 
(r — 1 ) • fix), where the last inequality follows from the assumption on /. ■ 
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3.1 Online Covering Algorithm 

The details are described below in Algorithm 1. 


Offline Input: function / : M 

Initialize: x := L and z := 0 ; 

while constraint vector Sk = (afci, • • • > Ofcn) arrives in round k do 
Set 2 /fc := 0 ; 

while Yh=i o-kiXi < 1 do 
Continuously increase yk'- 

Simultaneously for each i € [n], increase Xi at rate ^ ? and 

dz- 

increase Zi at rate ^ = a^j. 

end 

end 

Algorithm 1: Convex online covering 

The algorithm takes the objective function / of the covering problem as input, but as we shall 
later see, sometimes the algorithm is run on a modified version of / that might not even be convex. 
The vector x in the covering problem is the primal variable, and is initialized to some appropriate 
L e M". In each round k G [m], a new constraint vector = {aki,ak 2 , ■ ■ ■ ,CLkn) is presented to 
the algorithm. The algorithm increases Xj’s continuously in the following manner until the new 
constraint a^i > 1 is satisfied: 

Imagine the dual variable yk as time. As yk increases, the algorithm increases variable Xi with 
rate ^ simultaneously for all i G [re], where Vif{x) = is the partial derivative of / 

with respect to Xj, and p > 0 is some parameter to be determined. (We assume that the XiS are 
increased as stated, without worrying about how the differential equations are solved.) In order to 
maintain the auxiliary vector z = A'^y, we initialize z to 0, and in round k G [rre], for each z G [re], 
we also increase Zi at rate = Oki while yk increases. 

To show feasibility of the algorithm, we need to prove that all covering constraints are eventually 
satished. (Note that dual feasibility is trivial given our rule of updating y and z.) We will initialize 
X appropriately to ensure that at least one of XiS with Oki > 0 will increase as yk increases and 
eventually Ylie[n]^ki reaches I, at which moment the new constraint is satished and yk stops 
increasing. In Section 3.3, we shall show in Lemma 3.10 that if x is initialized such that each 
coordinate is non-zero, then the covering constraint will eventually be satished. Lemma 3.11 shall 
discuss the case when x is initialized to (l with additional assumptions on /. 

3.2 Competitive Analysis 

We will show two positive results regarding our online convex covering algorithm. The hrst result 
holds for general cost functions that are at most as convex as degree-r polynomials in the sense that 
(V/(x),x) < t/(x) for all x G M”. We prove that our algorithm is competitive with an additive 
loss of r • C{L), where L is the initial value of x. The precise statement is presented below. 

Theorem 3.1 Recall that L is the initial value ofx. Suppose that x < U throughout the algorithm.^ 
Then, letting p = maxjgj^] ^ > 1, and p = W“^-(lnp)'^, after the last round m, Algorithm 1 returns 

a primal vector such that C{x^'^^) < (rlnp)"^ • (7°^* -|- r • C'(L). 

^For instance, Xi < i- for any i G [nl. So Ui = i- is a feasible upper bound. 
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Our second positive result focuses on polynomial cost functions, an important special case in 
the literature as almost all known theoretical studies on related topics consider polynomial cost 
functions. The precise statement is presented below. 

Theorem 3.2 (Polynomial with Max Degree r) Suppose f : M" — >■ K is a convex multivari¬ 
ate polynomial with non-negative eoeffieients and maximum degree r. Then, we have the following. 

(a) Suppose the eost funetion f is a homogeneous polynomial, i.e, only monomials with degree 
exactly r can have non-zero eoeffieients. Then, there is an O {t log nY-competitive online 
covering algorithm. 

(b) Suppose that f can he expressed as a sum of N monomials. Then, there is an 0{T\ogNY- 
competitive online covering algorithm. 

Azar et al. [5] proved a lower bound of fl(rlogn)’^ (for sufficiently large n) for the special case 
when f{x) is the r-th power of the .^.^-norm of a set of linear cost functions, which is a homogeneous 
polynomial with degree r. Hence, the competitive ratio of our online algorithm is nearly optimal. 

Outline of Competitive Analysis. After every round, we wish to give an upper bound of C{x) 
in terms of P{y) = Yhj Vj ~ /*(-?)) where z = A^y. Since P{y) is the difference between the two 
terms yj and f*{z), our analysis will handle them separately: 

• In every round k G [m], we will compare the increase in f{x) with the increase in y^ (from 
0) and show that the formal is upper bounded by the latter up to a factor of p (Lemma 3.2). 
The analysis of this part is identical for both results. It only requires that at each round, the 
algorithm can increase x according to the rules to satisfy the online covering constraint. 

• At the end of the process, we will give a coordinate-wise upper bound for z. Since /* is monotone, 
this gives a lower bound for P{y). We will exploit additional properties of / that lead to a better 
upper bound to prove Theorem 3.2 for this part. 


3.2.1 Comparing Increases in f{x) and y^ in Each Round 


Lemma 3.2 For k G [m], let denote the primal vector at the end of round k, where x^^^ = L 
is the initial vector. Then, for eaeh k G [m], at the end of round k, we have 

yk>y ifix^''^) - f{x^^~^'>)) ■ 

In partieular, summing this inequality over all rounds k G [m], at the end of the process we have 

EfceH Vk>\- - f{L)) . 


Proof: Note that in round k G [m], the algorithm increases y^ only when Y17=i^kiXi < 1. 

Therefore, while yk is increased, we have: 


n 1 ^ 

1 > ^ akiXi = - ^ Vi/(f) 

i=i ^ i=i 


dxi 

dyY 


(3.3) 


where the equality comes from ^ 

As yk increases from 0 until the end of round k, x increases from to Hence, 

integrating both sides of (3.3) with respect to yk over the change during round k, we have: 


1 

yk> - 
p 



1 


s(fc) 


P Jx=x^^ 


(V/(f),df) 
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where the last equality comes from the fundamental theorem of calculus for path integrals of vector 
helds. ■ 


3.2.2 Proof of Theorem 3.1 

We will first show an upper bound for z = in terms of V/. 

Lemma 3.3 After the last round m, we have • V/(x^™'^). 

Proof: In each round k € [m], as yk increases, both Xi and Zi increases. Specihcally, 
dzi _ dzi dyu _ ^ifix) _ Vif{x) 

dxi dyk dxi pakiXi pxi pxi 

where the last inequality follows from the monotonicity of Vj/(-). Observe that as 2 increases from 
0 to = {z^\z^\ ... ,zi"^^) at the end of round m, each Xi increases from Lj to < Ui. 
Hence, integrating ^ —- with respect to Xi throughout the whole process, we have 


J"i) 


In ■ 


(m) 


In^ 


< 




as required. ■ 

Proof: [Theorem 3.1] Our choice oi p = ■ (In/r)'^ satisfies < 1. Hence, by the above 

lemma, the monotonicity of /*, and the conclusion of Lemma 3.1, we have 




)<r(^ 


•v/(f)) < 


1)-C{x^^'>) . 


Further note that from Lemma 3.2, at the end of round m, the dual vector y satishes 

yk>\-{C{x^^'^)-C{L)) . 

fcg[m] 


Hence, by p = r'^ ^ • (In pY, we have 

P(y) =Y.y>‘- f 2 - 1 )) ■ ^ 

k£[m] 

which implies the result after rearranging, since P{y) < P°p* < C°P. 

3.2.3 Proof of Theorem 3.2 


r-C(L) 


In this subsection, we consider the case when / is a multivariate polynomial with non-negative 
coefficients. If / has maximum degree r, then for all x G M”, (V/(x),x) < r • f{x). 

We will first show how to obtain better competitive ratio if the gradient V/ satisfies a stronger 
form of monotonicity. Next, we will explain how to approximate any polynomial cost function / with 
a surrogate cost function / such that its gradient V/ satisfies the stronger form of monotonicity. 
Finally, we obtain the competitive ratio in Theorem 3.2 by running our online covering algorithm 
with respect to the surrogate cost function /. 
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Better Competitive Ratio from Stronger Monotonicity of V/. For A > 0, we say that 
V/ is X-monotone, if for each i S [n], the function x is monotone. For example, if each Xi 

appears with degree at least (A + 1) in /, then Vf is A-monotone. With this assumption, we prove 
a different version of Lemma 3.3 

Lemma 3.4 Suppose that V/ is X-monotone for some A > 0. Then, after the last round m of 
Algorithm 1 running with f, we have ^ This is true even if f is not convex. 

Proof: Similar to the proof of Lemma 3.3, we give an upper bound on the vector z = A^y after 
the end of round m, which we denote by . Recall that we denote the primal vector after round 
m by For each is [n], 

^ J_ . Vi/(x) dx^ ^ dx^ 

dxi dyk dxi pCLkiXi pX dxi — pX x^^ dxi ’ 

where the last inequality follows from the A-monotonicity of V/ and x < xA ), 

Observe that as Zi increases from 0 to xi increases from 0 to x^^\ Hence, integrating with 
respect to Xi throughout the algorithm, we have for each i, z^^^ < ^ • Vif{AAf g 

Lemma 3.5 Suppose that f is convex, and for all x S M”, (V/(x),F) < r • f{x). Moreover, V/ 

7 ” — 1 

is X-monotone for some 0 < A < r. Then, Algorithm 1 with p := and initial primal vector 
L := 0 is {jY-competitive. 

Proof: We choose p := such that 7 ■= -^ = < 1. By Lemma 3.4, we have 

Y-A < y . V/(x*'™'^). Since / is convex, we can apply Lemma 3.1. Observing that f* is monotone, 
we have 

< (r - 1) . 7 ^ . f{xA)) = (r - 1) • (A)" . C{xA)) . 

On the other hand. Lemma 3.2 states that 

Eke[m] yk>^p- ifixA)) - f{L)) = ^ • C(xM) , 

since f{L) = /(O) = 0. 

Therefore, putting things together, we have 

m = EfceH Vm - ^ • CCfM) - (r - 1) • (A)- • C(xM) = {^Y • C{xA)) . 

Recall that P{y) < P°p* < C°^^, we have . (7°^*, as required. ■ 

Homogeneous Polynomial / with Degree r. Next, we consider a degree-r homogeneous 
polynomial / with non-negative coefficients. Note that any such polynomial satisfies our assump¬ 
tions on the convexity of /, namely, for all x G M”, (Vf{x),x) < t ■ f{x)- However, V/ could be 
only 0-monotone in general, in which case Lemma 3.5 cannot be applied. 

The idea is to approximate / with another function / such that V/ is A-monotone for some 
A > 0. We will start with some notations for describing polynomials. We use the vector d = 
{di,d 2 ,... ,dn) G Z”’ to describe the degrees of the xfs in a monomial. For a monomial with 
maximum degree r, we have di <t (where equality holds if the polynomial is homogeneous). 

Observe that there are such vectors d, for n > 4. We denote x‘^ := Uie[n]xt- Given a 

vector a with non-zero coordiniates, we denote the vector § := (f^, • • • > 

’ a ^ ai ■ a2 ' ’ otn' 
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Lemma 3.6 For any homogeneous convex polynomial function f with non-negative coefficients and 
degree t, if f has a monomial with positive coefficient that involves Xi, then the term xf must also 
have positive coefficient. 


Proof: Given / : M, we fix some i G [n] and define g : —)• R as g{x, y) := f{xei+y{l—ei)), 

i.e., we apply / to the vector whose f-th coordinate is x and all other coordinates are y. Observe 
that since / is convex, so is 5 . 


Suppose for contradiction’s sake, / depends on Xj, but the coefficient of xf is zero. Then, since 
/ is homogeneous with degree r, it follows that g can be expressed as g{x,y) = ^ 

where 1 < d < r and bd > 0 . 


We next consider the Hessian matrix H{g) 


- d^g a^g - 

dx'^ dxdy 

a^g a'^g 

_ayax dy'^ _ 


Because g is convex, the matrix H{g) is positive semi-definite, and hence the determinant \H{g)\ 
is non-negative. Observe that \H{g)\ is a polynomial in x and y. We shall reach a contradiction by 
showing that \H{g)\ is negative for some x and y. 


For the case d = 1, observe that = 0, and 
Hence, the determinant is negative in this case. 


afg 

dyax 


> 0 when x and y are both positive. 


For the case 2 < d < r, we fix some y > 0 and consider large x > 0. Since d < t, the monomial 
in \H{g)\ with largest degree in x is whose coefficient is 


bd ■ {d(d — 1 ) • (r — d)(T — d — 1 ) — d^(r — d)^} < 0 . 


Therefore, for sufficiently large x, the determinant \H{g)\ is dominated by the monomial 
^ 2 {d-i)y 2 (T-d-i) ^ hence is negative, reaching the desired contradiction. ■ 

Approximating /. Suppose /(x) = • x'^. In view of Lemma 3.6, we can assume, without 

loss of generality, that / depends on every x,. For each i G [n], let Cj > 0 be the coefficient of x[ in 

/. Let the vector a G R” be such that Oj := c[. 

For A > 0, define / : R” —s- R by /(x) := J2d ^d' 

Claim 3.1 The function f satisfies the following. 

1. V/ is X-monotone; 

(y/(x), x) < r(l -h A) • /(x); 

3. f is convex. 

Proof: The first statement follows because whenever Xj appears, its degree is at least 1 -|- A. The 
second statement follows because each monomial has degree at most (1 -|- A)r. 

We next prove that / is convex. Observe that /(x) = f{(f{x)), where (f : R!J: —)■ R!J: is defined 
by ^p{x)i := af ■ x.+^. 

Since the mapping t e-)- t^~^^ is convex for A > 0, it follows that for p,q > 0 such that p + q = 1, 
ifipx + qy) < p ■ ip{x) -\- q ■ (p{y), where the inequality performs coordinate-wise comparisons. 

Since / is monotone, we have f{px qy) < f{p ■ <p{x) + q ■ ‘p{y)) < p ■ /(x) -|- q ■ f{y), where the 
last inequality follows from the convexity of /. ■ 
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Lemma 3.7 (Approximation by /.) For 0 < A < 1, suppose f is defined as above. Then, we 
have 2 ^ • / < • /. 

Proof: Define g : M” —>■ M by g{x) := /(§) = x‘^, where 6 ^= c^- (g)'^. Since g is obtained 

from / by scaling each x,, the convexity of g follows from that of /. In particular, observe that 

for each i £ [n], the coefficient of xj" in g' is = Ci ■ = 1. Similarly, define g : —)• M by 

g{x) := /(§) = Therefore, it suffices to prove the analogous inequality for g and g. 

We define Z := Observe that Z = ^(1) < b J2ie[n] i where the inequality 

follows from the convexity of g, and the equality follows because g is homogeneous and the coefficient 
of each xl is 1 . 

Using Holder’s inequality {Y^iSiU < ' {Y.YIY with | = and | = 3 ^), we have 

g < ■ 5^+’’' • Thus, we have the upperbound <7^"’’^ < Z^ ■ g < ■ g. 

Define 6 := and let A := {d: > 9Z'\ index the coefficients that are at least the threshold 

9Z < 1. Therefore, all the terms xl have coefficients 1 and pass the threshold. 

Let A be the complement of A. Observe that there are at most n’’’ terms, and for each d, 
-(i+A)d < Hence, 

^ b^- <n^ -dZ-Y^ xY^'>^ . 

d&A *G[n] deA 

Thus, we get that 

g{xY^^ > Y ^ E ^d- ^ ^ • 9{x) > ^ • g{x) , 

d deA 


as required. ■ 

Putting together, we can use Lemma 3.5 to run Algorithm 1 on / to obtain an approximation. 
Proof: [Theorem 3.2(a)] By Lemma 3.5, when Algorithm 1 is run for the function / with 
P •= • ( '^T+D )'^(^+A y ._ the returned primal vector approximates /T^ 

with competitive ratio — 0{TlognY■ 

^ 1 

Furthermore, by Lemma 3.7, / and /!+•’' approximate each other with a multiplicative factor 

1 3 At , , 

of 2i+^ • 77 - 1 +-^ = 0(1)'’’. Therefore, it follows that the primal vector x'^™'' approximates / with 
competitive ratio 0 (t log n)'’’. ■ 

Corollary 3.1 (£p-norm of linear functions) Suppose there are I vectors c4 G (h € [1]) 
such that each Ck has at most d non-zero coordinates. Define A : —>• such that for A; G [/], 

Afc(x) := (cfc,x). For some p > 2, define for k G [1], fk{x) '■= Xk{xY- Define the cost function 
/(x) := X^fcg[;]/fc(T)- Then, there is an 0{p\ogdY-competitive online covering algorithm with 

respect to f. Equivalently, the competitive ratio is 0{plogd) with respect to the £p-norm ||A(x)||p. 

Proof: For each k G [^], we consider fk and fk as defined in Lemma 3.7. Recall that for each 
k G [^], there is some transformation (fk ■ 1^+ of the form (fk{x)i := aX ■ xj’’’"^ such that 

fkix) = fkiPkix)). Here, we will eventually set A := j^. 
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For each A: G [Z], define := , which has the form hk{x) = {Ck, xd+'’^)r^TTA , One can check 

readily that (Vhkix), x) = p ■ hk{x) and Vhk is A-monotone, since p > 2 and 0 < A < 1. Hence, 
the function h := hk also satisfies these properties. 

By Lemma 3.5, when Algorithm 1 is run for function h, the competitive ratio is with 
respect to h. 

1 1 

Finally, we observe that for each A: G [/], Lemma 3.7 states that (• hfc < /a, < di+-^ • hk- 

3Xp 

Hence, it follows that / and h approximate each other with a multiplicative factor of 0{d^+^) = 
Oiiy, for A := 

Hence, it follows that the competitive ratio is O(plogd)^ with respect to /, as required. ■ 

General polynomials / with Maximum Degree r. For a general convex multivariate poly¬ 
nomial function / = with non-negative coefficients and maximum degree, the previous 

method does not work if / is non-homogeneous. For some 0 < A < 1, we use another approxima¬ 
tion / = However, / might not be convex. Observe that convexity is used crucially 

in Lemma 3.1(c). We tackle this issue by comparing / with another convex function g := 

Let N be the number of monomials in / with positive coefficients. We denote r := (1 -|- A)r. 

Lemma 3.8 (Comparing / and g) Suppose f and g are as defined above. Then, we have: 

1. f is differentiable, monotone, and /(O) = 0; V/ is X-monotone; f* is monotone; 

2. (V7(x),f) <T- f{x); 

3. f^g<N^-f; 

4- V/ < Vg; 

5. for allzG f*{z) < ^ • g*{N^ • z). 

Proof: The first two statements can be verified directly. In the third statement, the lower bound 
is straightforward, and the upper bound follows from Holder’s inequality. 

We next verify the fourth statement. We shall use the inequality that for each d, f{x) > c^- x^ 
in the following argument. For each i G [n], 

VjC/(f) = {1 + X) ■ f{x)^ ■ X7if{x) 


(l + A)-/(f)". 


di ■ X 




d\di>l 


> 


. (1 + A)di • 
d 


E 

d\di>l 

= vj(f), 

as required. 

For the fifth statement, we have 

f*{z) = max {(x,^-7(f)} < max{(f,i)-^[^u} = -^max{{x,N^z) - g{x)} = -^■g*{N^z), 


where the inequality follows from / > ^ 


Even though / might not be convex, we can still apply Algorithm 1 on it. 
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Lemma 3.9 If we apply Algorithm 1 on f, then at the end of round m, we have: 

~ IV^ T ~ 

where p is chosen such that ^ < 1. 

Proof: Observe that Lemma 3.4 does not require convexity, and hence we still have 

^ v7(fM) ^ Vff(fM) 

^ - pX - pX ■ 


Since / might not be convex, we compare / with g, and apply Lemma 3.1 to s' in the following. 

'AtA 




1 


N^ \ 




N^ \ 


where ^ < 1 is needed for Lemma 3.1. ■ 

pA — 

Proof: [Theorem 3.2(b)] When we apply Algorithm 1 to /, from Lemma 3.2, at the end we have 
yj — p ■ Combining with Lemma 3.9, we have: 


m = E ^ 

je[m] 


- • 

p 





We pick p := {^Y ■ W h Then, ^ ^ < 1, and we have P{y) > {^Y ■ 

We pick A := . Hence, the vector is 0(r log A^)'^-competitive with respect to /~. 

~ 1 

Finally, observing that / and /i+-^ approximate each other with a multiplicative factor of 

A _j , 

A/'i+^ = 0(1), we conclude that x'-”*-' is also 0(r log A^)’^-competitive with respect to /. ■ 

3.3 Primal Feasibility 

Observe that in Algorithm 1 , in round k € [m], each Xi is increased according to the rule = YYpS) ' 
So far we have assumed that if Oki > 0, then Xi will be increased as yk increases to makes sure the 
covering constraint can be satisfied (i.e., > 1). We now resolve some technical issues 

concerning the increase of x. 

• What happens if Vif{x) = 0? Indeed, if / is a polynomial such that every xi occurs with degree 
at least 2, then Vi/(()) = 0. Suppose i G [n] is such that Oki > 0 and Vi/(x) = 0. This means 
that currently we can increase Xi such that the rate of change in / is 0. If there exists some 
e > 0 such that for all t G [0, e], Vif{x + te)) = 0, then we can increase Xi by e without changing 
the value of /. Therefore, we can increase Xi for each such i, and assume that for all i G [n] such 
that Oki > 0, for all e > 0, Vif{x + ee)) > 0. This assumption is used in Lemma 3.11. 
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If X is initialized to 0, then x = 0 is a solution to ■ One way to resolve this is to 

initialize x := L such that each coordinate has some small non-zero value to give each Xi some 
non-zero momentum to increase. However, this introduce an extra additive error term C{L) as 
in Theorem 3.1. We formally explain this approach in Lemma 3.10. 


In order to keep a multiplicative competitive ratio while initializing x = 0 as in Theorem 3.2, 
we show in Lemma 3.11 that our A-monotone condition on V/ ensures that Xi can be increased 
strictly as Uk increases. This is analogous to the situation that the differential equation ^ = -y/x 
with initial condition x(0) = 0 has a trivial zero solution, but what we need is the existence of 
a non-trivial solution x{t) = 


Lemma 3.10 will stop increasing—general case) Consider round k G [m] and suppose 
for some i G [n], a^i > 0. Suppose further that at the beginning of round k, the i-th coordinate Xj 
takes some non-zero value Li. Then, eventually the constraint (ofc,x) > 1 will be satisfied, and 
will stop increasing. 


Proof: Observe that for every i G [n] such that > 0, round k finishes as soon as Xj gets at 
least Hence, we have an upper bound U for the vector x throughout round k. In particular, 

^ki 

since V/ is monotone, we have V/(x) < Vf{U). 

Therefore, for each i G [n] such that a^i > 0 and Xj is initialized to some Lj > 0, we have 
^ , which is some positive constant independent of Vk- It follows that if Vk 

keeps on increasing, eventually Xj will reach at least and Vk will stop increasing. ■ 

^ki 

Lemma 3.11 (y^ will stop increasing—A-monotone case) Consider round k G [m] and sup¬ 
pose for some i G [n], Uki > 0. Suppose further that at the beginning of round k, the primal vector 
takes value x^^^ such that for all e > 0, Vif{x^^^ -C eci) > 0. Then, eventually the constraint 
{dk,x) > 1 will be satisfied, and yk will stop increasing. 


Proof: Observe that for i G [n], Xj will be increased to at most if Xj is already at least 
at the beginning of round k, then round k is finished immediately. Hence, we can obtain an upper 
bound U for the vector x throughout round k. 

We have ^ • Since Vj/(x) > 0 when the iih. coordinate is increased, we can write 


P^ki 


Xi dyk — Uf ' i ' dyk ' 


where the inequality holds because V/ is A-monotone. 

Integrating with respect to yk as y^ increases from 0 to t, we have pokit < ■ ixi{t)^ — XqJ. 

Hence, Xj(t) > (• t + x^f)^ increases strictly as a function of t. 

Moreover, observing that the last expression tends to -|-oo as t tends to -|-oo, we can conclude 
that eventually the covering constraint XlieH ^kiXi > 1 will be satisfied. ■ 


4 Convex Online Packing 


For the convex online packing problem, we will make explicit assumptions on the packing cost 
function f* and exploit them to design and analyze our online algorithms. Hence, even though we 
use the same notation, it is more natural to treat y as the primal vector and x as the dual vector. 
Again, we start with some regularity assumptions that we have discussed in the preliminary. 
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Assumption 4.1 The function f* is convex, differentiable, and monotone, and f*{0) = 0. 

Similar to the covering case, we will further assume that the marginal cost is monotone. Note 
that the marginal packing cost function, i.e., V/*, being monotone is not the same as that the 
marginal covering cost function, i.e., V/, being monotone. 

Assumption 4.2 The gradient V/* is monotone. 

Note that there are instances for which the offline convex packing problem whose objective can 
be arbitrarily large. Consider, for example, a convex packing problem with a linear cost function 
f*{z} = ^ the first request is ai = 1. Then, by letting yi to be arbitrarily large, we 

can get a feasible packing assignment with arbitrarily large objective. Obviously, such instances are 
not interesting for practical purposes. Hence, we will assume that the offline optimal is bounded. 

Assumption 4.3 The offline eonvex packing problem has bounded optimal objeetive. 

Our main result of for the online convex packing problem is an online algorithm with the optimal 
competitive ratio for polynomial cost functions. 

Theorem 4.1 (Polynomial Cost Functions) Suppose the packing cost function f* is a convex 
polynomial with non-negative eoefficients, zero constant term, and maximum degree r. Then, there 
is an 0 {t)- competitive online algorithm for the online convex packing problem. 

Due to a lower bound result of Huang and Kim [21], the above ratio is asymptotically tight 
even for the special case of a single resource with a degree-r polynomial cost function. 

As a key intermediate step for proving Theorem 4.1, we show the following positive results for 
any cost functions that are, informally speaking, (1) “at most as convex as” degree-r polynomials 
and (2) “at least as convex as” degree-A polynomials, for some parameters r > A > 1. 

Theorem 4.2 Suppose the packing cost function f* satisfies the following conditions: 

1. There exists some r such that for all z £ M”, {Vf*{z),z) < r • f*{z). 

2. There exists some A > 1 such that for all p > I, for all z G M”, Vf*{pz) > p^~^ ■ Vf*{z). 

Then, there is an -competitive online algorithm for the online eonvex packing problem. 

Note that if the cost function f* is a polynomial with maximum degree r and minimum degree 
A > 2, then Theorem 4.2 implies that there is an 0(r)-competitive online algorithm. To further 
prove Theorem 4.1 for polynomials with linear terms, we will explain in Section 4.3 how to handle 
the linear terms and reduce the problem to polynomials of degree at least 2. 
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4.1 Online Convex Packing Algorithm for Theorem 4.2 

The details are described in Algorithm 2 . 

Initialize: x := z := 0; 

while constraint vector = (o^i,..., akn) arrives in round k do 
Set Vk := 0; 

while OkiXi < 1 do 
Continuously increase yk'- 

Simultaneously for each i £ [n], increase Zi at rate ^ = Oki, and 
increase x according to x = V/*(/ 3 T); 

end 

end 

Algorithm 2: Convex online packing 

Here, the vector x plays an auxiliary role and is initialized to 0. Throughout the algorithm, 
we maintain the invariant z = and x = '\/f*{pz) for some parameter p > 1 to be determined 
later. In round k £ [m], the vector Sk = (ofci) • • • j «fcn) is given. The variable yk is initialized to 
0, and is continuously increased while Yi£[n] ^ki^i < 1. To maintain z = A^y, for each i £ [n], Zi is 
increased at rate ^ = Oki- As the coordinates of .?are increased, the vector x is increased according 
to the invariant x = V/*(pT). Since V/* is monotone, as yk increases, both z and z increase 
monotonically. We show in Lemma 4.1 that unless the offline packing problem is unbounded, 
eventually cLkiXi reaches 1, at which moment yk stops increasing and round k finishes. 

Observe that the coordinates of x are increased monotonically throughout the algorithm. Below 
we show that at the end of the process, the constraints X]ie[n] ^ satisfied for all j £ [m]. 
Hence, the vector x is feasible for the covering problem. 

In the rest of the section, for k £ [m], we let z^^'i denote the vector z at the end of round k, 
where z^^'^ := 0 . 

Lemma 4.1 (Dual Feasibility) Recall our assumption that the offline optimal packing objective 
is bounded. Then, in each round k £ [m], eventually we have Yie[n]'^kiXi > 1, and yk will stop 
increasing. 

Proof: During round k £ [m], the algorithm increases yk only when Y'i=i^kiXi < 1. Therefore, 
recalling z = when the algorithm increases yk, it also increases each Zi at rate = Oki. 

Hence, we have 

^ = 1 - {ak,vriz)) > 1 - ^ • {dk,vnpz)) 

~ “ pA-1 ■ {o,k,x) > 1 — pA-1 ) 

where the first inequality follows from the assumption S/f*{pz) > p^~^ ■ S/f*{z), and the last 
inequality follows because {dk,x) < 1 when yk is increased. 

Therefore, suppose for contrary that {dk,x) never reaches 1, then the objective function P{y) 
increases at least at some positive rate 1 — (recalling p > 1 and A > 2) as yk increases, which 
means the offline packing problem is unbounded, contradicting our assumption. ■ 
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4.2 Competitive Analysis for Theorem 4.2 

Lemma 4.2 (Bounding Increase in y) For k £ [m], let denote the vector z at the end of 
round k, where z^^'^ := 0. Then, at the end of round k when stops increasing (by Lemma 4-1) 

In particular, since /*(0) = 0, this implies that at the end of the algorithm, 

k€[m] 


Proof: Recall again that yk increases only when {dk,x) < 1, we have 


dzi 




n 


n 


Hence, integrating this with respect to yk thronghont round k, and observing that x = Vf*{pz), 
we have 

vk > / {vr{pz),dz) = - • inpzC^'^) - r(#-'))), 

where the last equality comes from the fundamental theorem of calculus for path integrals of vector 
fields. ■ 

Proof: [Theorem 4.2] Suppose 2 ^*”^ = A^y is the vector at the end of the algorithm, and = 
V/*(p^™)). Then, we have 

= /(Vr (p2-<™))) < (r - 1 ) • /*(p2-<™)) , 


where the inequality follows from applying Lemma 3.1(c) with the roles of / and f* reversed. 

On the other hand, 

m = ^ J • (4-1) 


Hence, it follows that 


c{x) ^ {T-i)-npz(^^) ^ (r-i)-/-r(2-<-)) ^ 

Piv) ~ i • f*{p^^^) - f*{z(^^) ~ \ -p^- f*{z(^'>) - f*{z(^^) 


p^~^ — 1 


(4.2) 


where the penultimate inequality follows because the assumption Vf*{pz) > p^ ^ ■ V/*( 2 ) implies 
that f*{pz^’^'^) = p f*{pz),d^ > p^ f*{z),d^ = p^ ■ /*(£(”*))), and the function 

* ^ is decreasing. 

Choosing p := and observing that is feasible for the covering problem, we have 

popt p.opt 

< ^ < (t - 1 ) 


P{y)-P{y)- P{y) “ A - 1 

1 , , 1 

The result then follows because A^-i = (1 + (A — 1)) < e. 


22 











4.3 Proof of Theorem 4.1 


Polynomial with Linear Terms. Observe that if a polynomial /* has linear terms, then for 
any A > 1, it does not satisfy the condition that Vf*{pz) > ■ Vf*{z) for any z G and any 

p > 1. We write f*{z) = (c, A) + /*(-?), where c = V/*(()) and f*{z) contains terms with degree at 
least 2. Therefore, for all p > 1, Vf*{pz) > p - Vf*{z). 

Then, the objective function becomes 

P{y) = (1,^ - r{A^y) = {l-Ac,^- fiA^y) . 

Moreover, the corresponding covering problem becomes min,j>o f{x) subject to Ax > 1 — Ac. In 
other words, in round k G [m], as the vector Sk arrives, the covering constraint becomes (ofc, x) > bk, 
where := 1 — (afc,c). If < 0, then the constraint is automatically satisfied, and we set yk = 0 
such that round k finishes immediately. Otherwise, we can run Algorithm 2 using the function f* 
and the constraint vector ^ in round k. 

Ok 

Next, we present a proof of Theorem 4.1 based on the above discussion. 

Proof: [Theorem 4.1] As discussed above, we write c := V/*(0) and f*{x) := f*{x) — {c,x) as 
the convex polynomial containing the terms of f* with degree at least A = 2 . (Observe that if f* 
contains only linear terms, then the problem is trivial because the objective is unbounded when 
there is some round k G [m] such that bf^ > 0.) 

For ease of exposition, we can assume that for each k G [m], 6^ := 1 — (afc,c) > 0. Otherwise, 
we can essentially ignore the variable yk by setting it to 0. We denote B as the diagonal matrix 
whose (k, k)-th entry is bk- By writing w := By, the objective function can be expressed in terms 
of w as P{w) := ( 1 , w) - f*{{B-^Afw). 

Hence, we can run Algorithm 2 using function f* such that in round k G [m], when the vector 
Sk arrives, we can transform it by dividing each coordinate by bk before passing it to the algorithm. 

By Theorem 4.2, using A = 2, it follows that the algorithm has competitive ratio 0{t). ■ 

5 Online Combinatorial Auction with Non-separable Production 
Cost 

In this section, we explain how to extend our algorithm for the online convex packing problem to 
a more general problem known as online combinatorial auction with production cost. 

In an online combinatorial auction, there is a seller with n types of items (i.e., resources) that 
are known upfront and a convex production cost function g = f* '■ M” —^ ]R_|_. Producing Zi units 
of resource i for all i G [n] incurs a production cost of f*{z). There are m buyers (i.e., requests) 
that arrive online. Each buyer j is associated with a value function® Vj : 2l"’] —)• such that Vj{S) 

is buyer j's value for getting a subset of items SC [n]. On the arrival of a buyer, the seller must 
choose a subset of items Sj to allocate to the buyer immediately without any information of future 
buyers. The objective is to maximize the social welfare, which is defined as total value of buyers, 
minus the production cost f*{z), where Zi is the number of j's such that i G Sj. 

We assume that /* have the same properties as in Section 4, i.e., f* is convex and differentiable, 
and both /* and V/* are monotone, and /*(0) = 0. In addition, we shall consider a couple more 

®For efficiency issues, we might need to assume that Vj is supermodular (i.e., for subsets A and B, Vj{A)+Vj{B) < 
Vj{A n B) + Vj{A U B)), but otherwise we do not need further assumptions. 
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technical assumptions on /*, which are true for most interesting functions such as polynomials. 

Theorem 5.1 Suppose the cost function f* is a convex polynomial with non-negative coefficients, 
zero constant term, and maximum degree r. Then, there is an 0{t)- competitive online algorithm 
running for the online combinatorial auction problem with production cost f*. 

Consider the following standard convex program relaxation of combinatorial auction with pro¬ 
duction costs and its Fenchel dual program: 

Piv) ■= EjeH E5c[n] ■ VjS - f*iz) s.t. 

yjS Zi 
Ssc[n] yjS — 1 

C{x, u) := /(f) + EjGH "“j s.t. 

Observe that in the packing problem, the objective function can be succinctly expressed as 
P{y) := {v, y) — f*{A'^y), where the coordinates of v are indexed by [m] x 2[”1, and A is the {0,1}- 
matrix whose rows are indexed by [m] x 21”! and columns are indexed by [n] such that for j G [m], 
S C [n] and i G [n], the (jS",i)-th entry is 1 iff i ^ S. We also denote xg '■= 

In this paper, we present an online algorithm that solves the above convex program fractionally. 
In each round k G [m], the value function Vk arrives, and the algorithm irrevocably chooses non¬ 
negative values for y^g for all S C [n] such that Ylig(z[n\ykS < 1- Observe that the algorithm 
knows the rows of A in advance, although it may not know the number m of rounds. Translating 
fractional algorithms into integral one is relatively straightforward and readers are referred to Huang 
and Kim [21] for details. The algorithm is given in Algorithm 3. 

Explanation of Algorithm. In each round k G [m], the variables are changed continuously as 
functions of some time parameter t G [0,1] such that t = 0 corresponds to the beginning of round k, 
and round k finishes when t reaches 1. Initially, yk{0) := 0 and Ufc(O) := 0. The following invariants 
are maintained. 

Invariant 1: The sum yksi^) increases at rate 1 with respect to t. 

In fact, the algorithm ensures that at any time t, there is exactly one S C [n] such that 
ykg is increased at rate 1. This ensures that when t reaches 1, the sum Ylgc[n]yksi't) is 1 to 
maintain the feasibility of y. To decide which ykg^s to increase at time t, a parameter is defined 
r{t) := maxyic[n] 7A(t), where jAit) ■= Vk{A) — XA{t). Intuitively, the sets S that attain r{t) are 
the most worthwhile to be selected. One technical issue is that whether r(t) can be computed 
efficiently. If the set function is supermodular, then r(t) can be computed efficiently. 

Invariant 2: A parameter e > 0 is chosen such that a variable y^g is increased at time t only if 
7s(i) > r{t) - e. 

We shall see that this invariant is used to bound the competitive ratio. One might attempt to 
define Invariant 2 with e = 0. The problem is that when y^g is increased, the vectors z = and 
X := Vf*{pz) will also be increased such that the set S might no longer satisfy 7 s(t) > r{f), even 
when t is increased infinitesimally. To choose the value of e and facilitate the implementation of 
the algorithm, we place some technical assumptions on /*, which are true for interesting functions. 


max 

y>o 

Vi G [n] : 
V/ G [m] : 

min 

x,'U>0 

Vj G [m],VS' C [n] : 
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Initialize: x := z := 0; 

while buyer k with value function arrives in round k do 
Set yk := 0 and Uk ■= 0; 

while t increases continuously from 0 to 1 do 

Define r{t) := maxAc[n] 7A(i), where 7A(t) := Vk{A) - 
Pick any S Q [n] that attains r{t). 
while 75 (t) > r(t) — e do 
Increase t continuously: 

1. Increase yks at rate = 1. 

2. To maintain the invariant z = A"^y, 

for each i £ S, Zi is increased at rate ^ = 1. 

3. As z is increased, we maintain x := \i'f*{pz) for some parameter p> 1. 

4. Increase Uk at rate = r{t). (Note that Uk is for analysis only.) 

end 

end 

end 

Algorithm 3: Online combinatorial auction with production cost 


• The gradient V/* is locally Lipschitz with respect to the £i-norm, i.e., for all z, for all R > 0, 
there exists some L such that \\z — a||i, \\z — 6||i < R implies that ||V/*(a) — V/*(6)||i < 
L • ||a — 6||i. 

f* (z) 

• For all R > 0, the infimum inf||^|^>j; || ^ || ^'^ is positive. This assumption means that the 
production cost cannot be zero as long as some resource is being used, and must grow at least 
proportionately as more resources are used. 

1 f*(z^ 

We shall see that it is sufficient to choose e := inf||^||^>^ IMi ^ depends on the 

first value function vi and the local Lipschitz constant of V/* around 0. 

Continuous vs Discrete Increments. Observe that during round k, the algorithm needs to 
change the yks to increase when jsi't) < ^(t) — this means that 'jsit) must have decreased by at 
least e from the time S is selected. Moreover, observe that r{t) > 70 (t) > 0. Hence, it follows that 
there can be at most 2"' • changes of S before t reaches 1. 

We show that the local Lipschitz property of V/* implies that instead of monitoring r{t) 
continuously, the algorithm can be implemented by discrete increments, even though it is more 
convenient to analyze it continuously. 

Notice that in round k, the £i-norm of the vector z = A"^y can increase by at most n, which 
happens if the complete set [n] is chosen throughout. Hence, it follows that ||p.?||i can change by at 
most pn according to the ^i-distance. Let L be the local Lipschitz constant for V/* in this vicinity 
of pz during round k. Observe that the mapping t 1—)• x{t) := Vf*{pz{t)) is Lpn-Lipschitz with 
respect to the £i-norm. 

Therefore, if t is increased by S := ||x(t)||i can increase by at most s. Hence, it follows 

that if 75(to) = then for all t G [to, to + 5], we have 75(t) > 75(^0) — e > '>’{t) — s, since r{t) 

is non-increasing. As a result, we can increase t at increments of 6, and compute r(t) and change 
subsets S for only | times before t reaches 1, and round k finishes. 
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Feasibility of Covering Problem. Observe that to maintain the feasibility of {x,u), at the end 
of round k, one can simply set Uk ■= max 5 c[n] I’kiS) — xs- However, to facilitate the competitive 
analysis, we increase Uk at rate = r(t) throughout round k. The next lemma shows that this 
also maintains the feasibility of (x, u). 

Lemma 5.1 (Feasibility of (x, ft)) After each round k E [m], for all S C [n], Uk > Vk{S) — xs- 

Proof: Recall that the time parameter t S [0,1] denotes the beginning of round k with t = 0 

and the end of round k with t = 1. Observing that x is increased monotonically, for all t G [0,1], 
xs{t) < a;s(l). 

Hence, we have = r{t) > Vk{S) — xs{t) > Vk{S) — X 5 (l). Integrating with respect to t from 
0 to 1 gives the result. ■ 

Competitive Analysis. The analysis is along the same lines as that of the convex packing 
problem. The main difference lies in how we bound the increase in y. Concretely, we will use the 
following lemma, which is an analogue of Lemma 4.2 in the convex packing problem. 

Lemma 5.2 (Bounding Increase in y) For k G [m], let denote the vector z at the end of 
round k, where := 0. Then, at the end of round k, 

^ VkiS) ■ Vks > Uk + y {f*{p^^'^) - f*{p^^~^'^)) - eRk , 

SC[n] 

where Rk is the amount of time in [0,1] during round k in which some non-empty set S is chosen 
to increase the variable yks- 

In particular, since f*{0) = 0, this implies that at the end of the algorithm, 

{v,^= Y + - ^Y^i ■ 

j£[m]SQ[n] jeH jG[m] 

Proof: Recall that by Invariant 2, at time t G [0,1], yks increases only if 75 (t) > r[t) — e. On the 
other hand, observe that if the empty set S' = 0 is chosen such that yk% is increased, both z and x 
remains the same. Hence, the invariant 70 (t) = r{t) is actually maintained with no error term. 

We define the indicator function y : [0,1] —)■ {0,1} such that x(t) = 1 iff a. non-empty set S is 
chosen to increase yks at time t. Then, we have 75 (t) > r{f) — e ■ x{t) 

Recall that = r{t). Hence, if yks is increased at time t, we have 

Vk{S) > ^ + xsit) - e ■ xit)- 

By Invariant 1, X]s'c[n] ~ Observe that for other S' / S, = q. Hence, we can 

multiply the above equation by = 1 , and include the zero terms in the sum for S' 7 ^ S to 
obtain the following. 


Y ■ 

5C[n] 


dVkS 

dt 


> 



5C[n] iGS 


dVkS 

dt 


£ • x{t) 


duk 

dt 


dzi 


le n 
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where the last equality follows from interchanging the order of summation, and ^ = X^5c[n] ieS 

Observe that x = Vf*{pz). Hence, integrating with respect to t from 0 to 1, the vector z 
increases from to z^^\ and we have: 

VkiS)-yks >Uk+ / {Vf*{pz),dz)-£-Rk=Uk^ - {f*{pz^’'^) - f*{pz^’'~^^)) - e ■ Rk , 

where Rk '■= Jq xi't)dt is the amount of time in which a non-empty set S is chosen to increase yks, 
and the last equality comes from the fundamental theorem of calculus for path integrals of vector 
helds. ■ 

Proof: [Theorem 5.1] We hrst use the trick in Section 4.3 to absorb the linear terms of f* into 
(n, ^ in the objective function P{y) = (n, ^ — f*{A^y). Hence, we can assume that each term in 
/* has degree at least 2 . 

By Lemma 5.2, after round m, we have 

P{^ = {V,^- >(!,«) + J • r{pz^^'>) - - eR, 

where R := Ejelm] 

Recalling that x = C{x,u) := {l,u) + f{Vf*{pz^^^)) < (1, n) + (r - 1) • 

where the last inequality follows from Lemma 3.1(c), since the polynomial f* is has maximum 
degree r. 

Hence, similar to inequality (4.2) in the proof of Theorem 4.2, we have: 


C{x,u) ^ 

~PW - 




< max{l, 


(r-iy 1 


Observe that the second argument of the maximum operator is exactly the same expression 
appearing in inequality (4.2) apart from the negative eR error term in the denominator. We nse 
the assumption that every term in f* has degree at least A = 2 and set p := 2. 

We next show how to choose e > 0 such that eR < ^ • f*{z^^')). Given the hrst value function 
vi, we can give a lower bonnd on ||.z^^)||i using the local Lipschitz property of /*. 

Let /3 := max 5 c[n] — 'i'i(0). We can assnme /3 > 0; otherwise, the empty set will be chosen 
in this round, and z and x remains zero. 

Observe that as t increases from 0 to 1, \\pz\\i < pn. Suppose L is the local Lipschitz constant of 
V/* in the £i-ball of radins pn around 0. Then, the function z ^ x '.= V/*(/ 9 z) has local Lipschitz 
constant pL. 

Observe that regardless of the value of e, if the empty set is ever chosen, the norm ||x||i must 
have increased by at least /?, which means ||.?||i has increased by at least On the other hand, if 
the empty set is never chosen, then as t increases, HzHi increases at rate at least 1. Hence, in any 
case, no matter what the value of e is, at the end of the hrst round, > Rq '■= min{l, ^}. 

1 f*iz^ 

Hence, after seeing the hrst value function ui, the algorithm can choose e := ^ inf|| 2 ^|^>^g IMi ' 
Observing that z increases monotonically throughout the algorithm, at the end of round m, 
11 ^™^111 > Ro- Hence, we have eR < ^ ■ ■ R < ^ where the last inequality 
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follows because during the time period of measure R in which a non-empty set is chosen, ||z||i 
increases at rate at least 1. 

Therefore, by a similar analysis as in the proof of Theorem 4.2, the competitive ratio is 0(t), 
as required. ■ 

References 

[1] Noga Alon, Baruch Awerbuch, Yossi Azar, Niv Buchbinder, and Joseph Seffi Naor. A general 
approach to online network optimization problems. ACM Transactions on Algorithms (TALC), 
2(4):640~660, 2006. 

[2] James Aspnes, Yossi Azar, Amos Fiat, Serge Plotkin, and Orli Waarts. On-line routing of 
virtual circuits with applications to load balancing and machine scheduling. Journal of the 
ACM (JACM), 44(3):486-504, 1997. 

[3] Baruch Awerbuch, Yossi Azar, Edward F Grove, Ming-Yang Kao, P Krishnan, and Jef¬ 
frey Scott Vitter. Load balancing in the 1 p norm. In Foundations of Computer Science, 
1995. Proceedings., 36th Annual Symposium on, pages 383-391. IEEE, 1995. 

[4] Yossi Azar, Umang Bhaskar, Lisa Fleischer, and Debmalya Panigrahi. Online mixed packing 
and covering. In Proceedings of the Twenty-Fourth Annual ACM-SIAM Symposium on Discrete 
Algorithms, pages 85-100. SIAM, 2013. 

[5] Yossi Azar, Ilan Reuven Cohen, and Debmalya Panigrahi. Online covering with convex objec¬ 
tives and applications. arXiv preprint arXiv:1412.3507, 2014. 

[6] Nikhil Bansal, Niv Buchbinder, and Joseph Seffi Naor. A primal-dual randomized algorithm 
for weighted paging. Journal of the ACM (JACM), 59(4):19, 2012. 

[7] Yair Bartal, Rica Gonen, and Noam Nisan. Incentive compatible multi unit combinatorial auc¬ 
tions. In Proceedings of the 9th conference on Theoretical aspects of rationality and knowledge, 
pages 72-87. ACM, 2003. 

[8] Kshipra Bhawalkar, Sreenivas Gollapudi, and Debmalya Panigrahi. Online set cover with set 
requests. Approximation, Randomization, and Combinatorial Optimization. Algorithms and 
Techniques (APPROX/RANDOM 2014), 28:64-79, 2014. 

[9] Avrim Blum, Anupam Gupta, Yishay Mansour, and Ankit Sharma. Welfare and profit max¬ 
imization with production costs. In Foundations of Computer Science (FOCS), 2011 IEEE 
52nd Annual Symposium on, pages 77-86. IEEE, 2011. 

[10] Stephen Boyd and Lieven Vandenberghe. Convex optimization. Cambridge university press, 
2009. 

[11] Niv Buchbinder, Shahar Chen, Anupam Gupta, Viswanath Nagarajan, and Joseph (Seffi) 
Naor. Online packing and covering framework with convex objectives. arXiv preprint 
arXiv: 1412 . 8347 , 2014. 

[12] Niv Buchbinder and Rica Gonen. Incentive compatible mulit-unit combinatorial auctions: A 
primal dual approach. Algorithmica, pages 1-24, 2013. 


28 


[13] Niv Buchbinder and Joseph Naor. The design of competitive online algorithms via a primal: 
dual approach. Foundations and Trends in Theoretical Computer Science^ 3(2-3):93-263, 2009. 

[14] Niv Buchbinder and Joseph Naor. Online primal-dual algorithms for covering and packing. 
Mathematics of Operations Research, 34(2):270-286, 2009. 

[15] Nikhil R Devanur and Zhiyi Huang. Primal dual gives almost optimal energy efficient online 
algorithms. In SODA, pages 1123-1140. SIAM, 2014. 

[16] Nikhil R Devanur and Kamal Jain. Online matching with concave returns. In Proceedings of 
the forty-fourth annual ACM symposium on Theory of computing, pages 137-144. ACM, 2012. 

[17] Nikhil R Devanur, Kamal Jain, and Robert D Kleinberg. Randomized primal-dual analysis 
of ranking for online bipartite matching. In Proceedings of the Twenty-Fourth Annual ACM- 
SIAM Symposium on Discrete Algorithms, pages 101-107. SIAM, 2013. 

[18] Anupam Gupta, Ravishankar Krishnaswamy, and Kirk Pruhs. Online primal-dual for non¬ 
linear optimization with applications to speed scaling. In Approximation and Online Algo¬ 
rithms, pages 173-186. Springer, 2013. 

[19] Anupam Gupta and Viswanath Nagarajan. Approximating sparse covering integer programs 
online. Mathematics of Operations Research, 2014. 

[20] Zhiyi Huang. Sigact news online algorithms column 25: Online primal dual: Beyond linear 
programs. ACM SICACT News, 45(4):105-119, 2014. 

[21] Zhiyi Huang and Anthony Kim. Welfare maximization with production costs: A primal dual 
approach. In SODA, pages 59-72. SIAM, 2015. 

[22] Frances Yao, Alan Demers, and Scott Shenker. A scheduling model for reduced cpu energy. 
In Foundations of Computer Science, 1995. Proceedings., 36th Annual Symposium on, pages 
374-382. IEEE, 1995. 


29 


